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Introduction 

Tumors  exhibit  abnormal  epigenetic  regulation  (1).  A  key  epigenetic  event  is 
DNA  methylation,  typically  at  the  dinucleotide  CpG.  Both  hypomethylation  and 
hypermethylation  of  DNA  is  observed  in  cancer  cells.  Generally,  the  former  leads  to 
activation  of  protooncogenes  and  the  latter  to  repression  of  tumor  suppressors  during 
the  initiation  and  progression  of  cancer.  The  Methyl-CpG-Binding  Domain  (MBD) 
family  of  proteins  is  among  the  ones  that  bind  methylated  DNA  directly  and  this 
association  is  critical  in  regulating  the  outcome  of  DNA  methylation  (2,3).  The  MBD 
family  includes  proteins  named  MBD1,  MBD2,  MBD4  and  MeCP2.  (MBD3  contains 
the  MBD  but  does  not  bind  methylated  DNA  directly.)  The  molecular  events  leading 
from  DNA  methylation  or  demethylation  to  gene  regulation  during  cancer  progression 
are  poorly  defined.  In  this  study,  the  binding  of  MBD  proteins  throughout  the  entire 
genome  of  several  malignant  prostate  tumors  and  matched  normal  or  benign  tissues 
will  be  identified.  This  provides  novel  fundamental  information  about  epigenetic 
regulation  during  carcinogenesis.  Also,  the  differences  in  these  patterns  will  identify 
new  genes  regulated  by  MBD  binding  and  DNA  methylation  during  prostate  cancer 
progression.  These  genes  potentially  identify  novel  biomarkers  or  therapeutic  targets. 
The  identity  of  bound  MBD  protein  will  facilitate  subsequent  studies  that  examine  the 
“downstream”  molecular  events  at  a  specific  gene  during  progression  of  prostate 
cancer. 

Body 

As  a  brief  overall  summary,  the  genomic  profiles  of  MBD  proteins  in  malignant 
and  matched  normal  or  benign  prostate  tissue  was  examined  by  Chromatin 
Immunoprecipitation  (ChIP)  followed  by  Next  Generation  Sequencing  (NGS  or  Seq). 
The  details  of  the  ChIP-Seq  experiments  are  described  below.  The  subsequent 
bioinformatics  analysis  of  this  project  was  performed  to  identify  genes  potentially 
targeted  and  regulated  by  MBD  proteins  -  described  below.  This  downstream 
informatics  analysis  was  demanding  because  MBD  proteins  are  associated  with  the 
genome  over  wide  regions,  a  major  difference  from  “typical”  DNA-binding  proteins 
such  as  CREB  or  C/EBP  which  bind  relatively  small  and  discrete  recognition  sites 
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and  the  bioinfomatic  analysis  required  increased  effort.  The  pattern  of  MBD  binding  is 
similar  to  that  observed  with  many  histone  modifications,  another  form  of  epigenetic 
regulation.  One  thing  I  have  learned  from  this  study  is  that  additional  information  can 
be  obtained  and  questions  addressed  with  continued  bioinformatic  approaches, 
almost  ad  infinitum,  so  this  analysis  will  be  a  continuing  process.  In  the  following 
summary,  I  have  attempted  to  include  representative  and  important  analyses. 

The  first  task  includes  the  Chromatin  Immunoprecipitations  (ChIP)  and  Next 
Generation  Sequencing  (NGS  or  Seq)  experiments.  Five  sets  of  matched  malignant 
prostate  carcinomas  and  normal/benign  prostate  tissues  were  obtained  from  the 
Cooperative  Human  Tissue  Network  (http://www.chtn.nci.nih.gov).  These  tissues 
were  divided  into  two  portions.  The  larger  portion  was  prepared  for  ChIP  experiments 
using  the  truChIP  Tissue  Chromatin  Shearing  Kit  (Covaris,  Woburn,  MA).  (The 
smaller  portion  of  tissue  was  used  for  isolation  of  RNA  -  see  below.)  Briefly,  the 
tissues  were  minced,  treated  with  formaldehyde  and  this  reaction  was  quenched  per 
manufacturer’s  suggested  protocol.  Each  tissue  was  disrupted  and  the  separated 
cells  that  resulted  were  lysed.  The  nuclei  were  collected  by  centrifugation,  then 
sonicated  to  lyse  the  nuclei  and  shear  the  chromatin  in  an  Adaptive  Focused 
Acoustics  sonicator  (Covaris).  Chromatin  immunoprecipitations  were  carried  out 
using  antibodies  that  recognize  MeCP2,  MBD1,  MBD2  and  MBD4  and  the  ChIP-IT 
High  Sensitivity  Kit  (Active  Motif,  Carlsbad,  CA).  The  resulting  “ChIP  DNA”  was 
quantified  using  a  Cubit  fluorometer  (Life  Technologies,  Grand  Island,  NY),  then 
concentrated  and  purified  by  ethanol  precipitation  using  linear  acrylamide  as  a  carrier 
and  quantified  again  at  the  end  using  the  Cubit  fluorometer.  The  amounts  of  DNA 
obtained  after  the  ChIP  and  ethanol  precipitation  made  the  next  step,  the  preparation 
of  libraries  for  Next-Generation  Sequencing,  technically  challenging  and  required 
more  time  than  allotted  in  the  “Statement  of  Work”.  This  delay  led  to  the  request  and 
approval  of  a  no-cost  extension  on  the  grant. 

The  SeqPlex  DNA  Amplification  Kit  (Sigma,  St  Louis,  MG)  was  used  to 
generate  the  libraries  for  NGS  following  the  manufacturer’s  suggested  protocol.  ChIP 
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DNA  from  three  matched  sets  of  malignant  prostate  cancer  and  normal/benign 
prostate  tissue  were  sequenced  using  the  Ion  Torrent  (Life  Technologies). 
Approximately  1  million  sequencing  “reads”  were  performed  for  each  sample.  With 
additional  resources,  a  higher  number  of  reads  per  sample  might  increase  the 
number  of  identifiable  peaks  (that  reach  the  appropriate  statistical  threshold). 

The  second  task  will  be  brought  into  the  discussion  at  this  point,  prior  to  the 
bioinformatics  analysis  portion  of  Task  1,  because  the  information  from  the 
microarrays  performed  in  the  second  task  was  used  in  the  bioinfomatic  analysis.  In 
brief,  RNA  was  isolated  from  the  prostate  tissues,  a  microarray  was  performed  to 
identify  genes  that  are  differentially  expressed  in  malignant  prostate  tissues  and  in 
the  bioinformatic  analysis,  this  information  was  correlated  with  binding  of  MBD 
proteins.  An  mRNA  fraction  was  isolated  from  each  tissue  using  an  RNeasy  Kit  and  a 
Qiacube  (Qiagen  USA,  Valencia,  CA).  (In  addition,  microRNA  fractions  have  been 
isolated  from  two  sets  of  matched  tissues  to  expand  this  study  to  include  microRNAs 
in  the  future.)  Equal  amounts  of  mRNA  from  three  normal  prostate  tissues  and  from 
three  stage  3  malignant  prostate  carcinomas,  respectively,  were  mixed  and  assayed 
using  an  Affymetrix  microarray.  Using  this  pooled  set  of  mRNAs,  the  microarray 
experiment  was  used  to  generate  a  list  of  genes  that  are  increased  >2-fold  and 
decreased  >2-fold  in  the  stage  3  prostate  cancer  tissue  relative  to  normal  tissue  (see 
Tables  1  and  2  in  Supporting  Data  for  98  most  up-regulated  and  down-regulated 
genes).  Genes  of  interest,  namely  their  expression  levels,  will  need  to  be  validated 
by  reverse  transcriptase-quantitative  PCR  (RT-qPCR)  using  mRNAs  from  matched 
sets  of  tissues.  The  threshold  of  2-fold  is  arbitrary  and  gene  lists  using  different 
thresholds  can  be  generated  easily. 

The  bioinformatics  analysis  (part  of  Task  1)  was  used  to  address  different 
questions.  As  a  brief  introduction  of  the  bioinfomatic  analysis,  “alignment”  refers  to 
the  process  of  identifying  (or  aligning)  the  sequence  reads  against  the  reference 
genome,  in  this  case  the  hg19  compilation  of  the  human  genome.  This  process 
records  the  sequencing  reads  attributable  to  each  genomic  region.  After  the 
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alignments  are  performed,  they  are  compared  between  MBD-family  ChIPs  and  either 
input  DNA  or  an  IgG-control  ChIP  to  identify  “peaks”  or  “regions”  of  aligned  sequence 
reads  within  the  genome.  The  chromosomal  locations  and  corresponding  genes 
within  those  peak  regions  are  analyzed  to  identify  potential  MBD  targets. 

In  one  analysis,  the  alignments  produced  by  the  software  of  the  Ion  Torrent 
sequencer  were  used.  These  results  were  then  processed  (in  collaboration  with  H. 
Chen)  in  a  software  package  called  RSEG  (4)  to  identify  the  boundaries  of  the  peaks 
from  the  ChIP-Seq  experiments.  RSEG  was  developed  to  identify  the  boundaries  of 
broad,  less  discrete  peaks  such  as  those  associated  with  histone  modification.  This 
property  makes  it  useful  for  the  analysis  of  MBD  binding  profies.  The  peaks  identified 
by  RSEG  were  analyzed  using  the  tools  within  the  Galaxy  website 
(https://usegalaxy.org)  to  identify  genes  that  overlap  the  peaks.  An  alternative 
analysis  is  using  the  commercially  available  Genomics  Suite  -  ChIP-Seq  workflow 
(Partek,  St  Louis  MO),  which  has  been  used  in  other  publications  (4,  5).  The  analysis 
using  RSEG/Galaxy  and  Genomics  Suite  (Partek)  will  be  described  below. 

The  bioinformatics  is  an  on-going  task.  The  analysis  above  is  being  continued 
to  extract  additional  information  and  results  for  publication.  The  bioinformatics 
analysis  (in  collaboration  with  Q  Tran)  is  also  being  performed  in  a  third  way  for 
comparison  using  Bowtie  (www.bowtie-bio.sourceforqe.net)  to  perform  the 
alignments  and  MACS  (http://liulab.dfci.harvard.edu/MACS/)  to  identify  the  peaks. 
This  is  ongoing  and  not  described  below. 

The  analysis  of  the  MBD1  ChIP-Seq  experiments  using  RSEG  and  Galaxy  to 
identify  and  analyze  **maliqnant-specific**  MBD1  ChIP-Seq  peaks  is  summarized 
next.  MBD1  ChIP-SEQ  was  performed  using  three  matched  sets  of  malignant  and 
normal/benign  prostate  tissues.  As  described  above,  RSEG  was  used  to  identify  the 
MBD1  ChIP-Seq  peaks  from  matched  malignant  and  normal  or  benign  tissues.  Using 
Galaxy,  the  MBD1  ChIP-Seq  peaks  specific  to  the  malignant  tissue  of  each  sample 
were  identified  by  subtracting  the  peaks  from  the  normal  or  benign  tissue  from  the 
peaks  present  in  the  matched  malignant  prostate  tissue.  These  resulting  peaks  from 
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each  of  three  sets  of  samples  were  compared  and  32  malignant-specific  MBD1 
peaks  common  to  all  three  samples  were  identified  (see  Table  3).  There  is  a  large 
difference  in  the  number  of  common  peaks  identified  when  all  three  samples  are 
compared  versus  the  pair-wise  comparisons.  When  the  malignant-specific  MBD1 
peaks  were  compared  in  a  pair-wise  manner,  164,  94  and  483  common  peaks  (or 
genomic  regions)  were  identified  (list  of  peaks  not  shown).  The  malignant-specific 
MBD1  peaks  common  to  all  three  samples  were  analyzed  to  identify  regions 
containing  known  genes.  Of  these  32  peaks,  most  are  gene-containing  region  and 
this  analysis  yielded  31  malignant-specific  peaks  (see  Table  4).  These  31  gene- 
containing  regions  were  examined  using  Integrated  Genome  Browser  (IGB; 
www.bioviz.org).  Twenty-six  of  these  regions  contain  6  genes  or  less  (data  not 
shown).  The  relative  expression  level  of  the  genes  from  these  26  regions  in 
malignant  versus  normal  prostate  tissues  was  compared  using  the  complete 
microarray  results  described  above.  Of  these  26  regions,  nineteen  contain  just  one  or 
two  genes.  (Note:  One,  some  genes  within  these  26  regions  were  not  found  within 
the  microarray  and  levels  could  not  be  determined  from  that  experiment.  Two,  the 
regions  beyond  the  26  above  were  not  investigated  further  because  they  contain 
between  10  and  33  genes.  The  large  number  of  genes  in  these  regions  greatly 
diminishes  the  value  of  the  analysis.)  Of  the  genes  overlapping  the  malignant-specific 
MBD1  peaks  common  to  all  three  tissue  samples,  five  genes  had  decreased  levels  of 
RNAin  malignant  tissues:  SLC22A15,  FAM110C,  PLEKHG4B,  PDCD6  and  HRSP12 
(listed  in  Table  4).  Decreased  expression  was  defined  as  RNA  level  less  than  80%  of 
that  observed  in  normal  tissues.  Of  these  investigated  genes,  six  genes  had  at  least 
1.4-fold  increased  RNA  level  in  malignant  relative  to  normal  tissues:  CASQ2,  DPF3, 
KIAA0226,  CCDC127,  PTPRN2  and  snoRNA72  (listed  Table  4).  As  a  reference 
point,  DHFR  RNA  level  was  increased  by  1 .2-fold  in  the  microarray.  Some  of  the 
noteworthy  features  of  these  genes  include  the  role  of:  FAM1 1 0C  in  the  Akt 
signaling  pathway  and  prostatitis,  PLEKFIG4  in  phospholipid  binding  and  Rho  guanyl- 
nucleotide  exchange  factor  activity  and  PDCD  in  programmed  cell  death,  DPF3  as  a 
subunit  of  BAF  -  a  chromatin  remodeling  complex,  KIAA0226’s  homology  to 
PLEKFIM1L,  a  plekstrin-homology  domain  contain  protein  and  PTPRN2  as  a  receptor 
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with  tyrosine  phosphatase  activity.  Additional  literature  research  investigating  protein 
function  and  activity  is  ongoing.  Additional  experiments  to  evaluate  their  expression 
level  in  the  matched  tissues  will  be  performed. 

Many  questions  can  be  asked  with  this  ChIP-Seq  data.  For  example,  the 
genomic  regions  containing  the  MBD1  peaks  in  each  INDIVIDUAL  tissue  (as 
opposed  to  the  analysis  above  examining  what  is  common  to  all  three  sets)  can  be 
analyzed  for  the  genes  present  and  expression  levels  of  those  genes.  As  another 
example,  the  peaks  common  to  only  two  tissues  can  be  obtained  by  carrying  out  the 
pair-wise  comparison  of  ChIP-Seq  peaks.  Based  on  the  numbers  listed  above,  this  is 
less  stringent  but  could  identify  genes  that  are  important  to  the  process  of  prostate 
cancer. 

A  similar  analysis  using  RSEG  and  Galaxy  was  used  to  examine  the  MeCP2 
ChIP-Seq  experiments  and  identify  and  analyze  maliqnant-specific  MeCP2  ChIP-Seq 
peaks.  MeCP2  ChIP-Seq  was  performed  using  the  same  three  matched  sets  of 
malignant  and  normal/benign  prostate  tissues  as  used  in  the  MBD1  study  above. 
RSEG  was  used  to  identify  the  boundaries  of  the  ChIP-Seq  peaks.  Using  Galaxy,  the 
maliqnant-specific  MeCP2  ChIP-Seq  peaks  for  each  sample  were  identified  as 
described  above.  These  resulting  peaks  from  each  of  three  sets  of  samples  were 
compared  and  38  malignant-specific  MeCP2  peaks  common  to  all  three  samples 
were  identified  (see  Table  5).  For  comparison,  when  the  malignant-specific  MeCP2 
peaks  were  compared  in  a  pair-wise  manner  between  samples,  there  were  1835,  462 
and  1815  common  peaks  were  identified  (peak  lists  not  shown).  The  malignant- 
specific  MeCP2  peaks  common  to  all  three  matched  samples  were  analyzed  to 
identify  regions  that  contained  known  genes  within  these  common  peaks.  Of  these 
38  peaks,  most  contained  genes  and  this  analysis  yielded  29  malignant-specific 
peaks  (see  Table  6).  These  29  genomic  regions  were  examined  using  Integrated 
Genome  Browser.  Twenty-three  of  these  regions  contain  7  genes  or  less  (data  not 
shown).  The  relative  expression  level  of  these  genes  in  malignant  versus  normal 
prostate  tissues  was  compared  using  the  complete  microarray  results  described 
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above.  (NOTE:  Of  these  23  regions,  fifteen  contain  just  one  or  two  genes.  The 
remaining  regions,  which  contain  between  11  and  20  genes,  will  not  be  analyzed 
further  at  this  time  because  the  large  number  of  genes.)  Of  the  genes  overlapping  the 
malignant-specific  MeCP2  peaks  common  to  all  three  tissue  samples,  five  genes  had 
decreased  levels  of  RNA  in  malignant  tissues:  SORC3,  C11orf45,  ACOXL,  GINS1 
and  GRIA1  (listed  in  Table  6).  Decreased  expression  was  defined  as  RNA  level  less 
than  80%  that  observed  in  normal  tissues.  Of  these  investigated  genes,  six  genes 
had  at  least  1 .4-fold  increased  RNA  level  in  malignant  relative  to  normal  tissues: 
RPS6KC1,  CDH11,  PTPRM,  LYPD1,  NCKAP5  and  PTPRT  (listed  in  Table  6).  Some 
of  the  noteworthy  features  of  these  genes  include  the  role  of:  SORC3  in  vacuole 
sorting,  the  association  of  ACOXL  with  leukemias,  GIN,  an  replication  initiation  factor 
which  contains  GINS1  as  a  subunit,  RPS6KC1  in  regulating  protein  synthesis, 
CDHITs  involvement  in  signaling  as  a  cadherin,  PTPRM  and  PTPRT  which  are  both 
receptors  containing  tyrosine  phosphatase  activity  and  LYPD1  (also  known  as  C4.4) 
in  tumor  metastasis.  Additional  experiments  to  examine  gene  expression  levels  and 
additional  literature  research  investigating  protein  function  are  needed. 

The  MBD4  ChIP-Seq  experiments  were  analyzed  in  order  to  identify  the 
malignant-specific  peaks.  In  this  case,  the  ChIP-Seq  data  from  just  two  of  three 
matched  tissue  sets  used  in  the  MBD1  and  MeCP2  studies  above  was  obtained. 
RSEG  and  Galaxy  were  used  as  described  above  to  identify  the  maliqnant-specific 
MBD4  ChIP-Seq  peaks  for  each  sample.  This  analysis  for  MBD4  will  be  described  in 
a  bit  more  detail  than  for  MBD1  and  MeCP2  to  provide  some  insight  into  the  number 
of  peaks  that  were  typically  identified  in  the  ChIP-Seq  experiments.  In  one  set  of 
matched  tissues,  1173  malignant-specific  peaks  were  identified  out  of  1,936  peaks 
identified  from  the  malignant  tissue.  In  the  second  set  of  matched  tissues,  760 
malignant-specific  peaks  were  identified  out  of  1,130  peaks.  When  the  malignant- 
specific  peaks  are  compared,  143  overlapping  malignant-specific  peaks  between 
these  two  tissues  were  identified  (see  Table  7).  The  genes  within  these  peaks  will  be 
identified  and  expression  will  be  examined  using  the  previously  discussed  microarray 
and  additional  experiments  such  as  RT-qPCR  or  microarrays. 
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MBD2  ChIP-Seq  data  was  also  obtained  for  only  two  of  three  matched  sets  of 
tissues  used  in  the  MBD1  and  MeCP2  studies  above.  The  analysis  was  performed  as 
described  for  MBD4.  The  overlapping  malignant-specific  peaks  will  be  analyzed  for 
overlapping  genes  and  gene  expression  will  be  examined  further. 

The  analysis  performed  using  Genomics  Suite  is  described  next.  The  first  part  of 
the  analysis  is  aimed  at  identifying  the  sites  where  the  methyl  CpG-binding  domains 
proteins  are  associated.  Briefly,  the  raw  high-throughput  sequencing  data  was 
aligned  against  a  reference  human  genome  (hg19)  to  identify  peaks  (also  called 
“regions”).  The  alignment  was  performed  in  pairs  -  experimental  ChIP  DNA  and  input 
DNA  from  malignant  or  normal/benign  samples.  The  peaks  that  passed  the  criterion 
of  a  false  discovery  rate  (FDR)  equaling  0.2  or  less  were  identified,  then  the  peaks 
found  in  the  experimental  ChIP  sample  or  not  and  those  found  in  the  input  DNA  or 
not  were  sorted.  The  peaks  that  were  found  in  the  experimental  ChIP  were  filtered  to 
find  the  ones  that  are  specific  to  the  ChIP  (that  is,  not  found  in  the  input).  The 
outcome  of  these  operations  is  to  identify  peaks  (i.e.,  regions  within  the  genome) 
associated  with  the  corresponding  methyl  CpG-binding  domain  protein  (MBD1, 

MBD2,  MBD4  or  MeCP2). 

MBD1  ChIP-Seq  results  were  obtained  from  three  matched  sets  of  samples  - 
(Sample  or  Patient)  #3,  #4  and  #6.  Using  the  Partek  Genomics  Suite  software,  the 
normal  tissue  samples  1897,  1491  and  1257  peaks  (also  called  “regions”), 
respectively,  were  identified  in  the  normal  tissues  and  1826,  1814  and  1355  peaks, 
respectively,  were  identified  in  the  malignant  tissues.  Of  the  1897  identified  peaks  (or 
“regions”)  found  in  the  normal  tissue  of  sample  #3,  1234  peaks  were  identified  in  the 
MBD1  Chip  and  1329  peaks  were  identified  in  the  input  DNA.  This  distribution  was 
roughly  similar  for  all  samples  (malignant,  normal  or  benign  and  each  ChIP  -  MBD1 , 
MBD2,  MBD4  and  MeCP2).  In  our  continued  analysis,  we  found  that  21 7  of  the  1 897 
peaks  from  Sample  #3  normal  tissue  passed  the  false  discovery  rate  (FDR).  Of  those 
217  peaks,  174  were  present  in  the  MBD1  ChIP-Seq  samples.  Of  those  174  peaks. 
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42  were  specific  to  the  MBD1  ChIP  (that  is,  the  peaks  were  NOT  present  in  the  Input 
DNA).  These  MBD1 -specific  peaks  are  listed  in  Table  8  and  are  predicted  to 
correspond  to  genomic  regions  where  MBD1  is  associated  in  the  normal  tissue  from 
Sample  #3.  Continuing  with  the  Partek  Genomics  Suite,  when  it  was  used  to  identify 
genes  within  10  kb  of  the  peaks,  those  42  peaks  yielded  a  set  of  41  Transcript  IDs 
(see  Table  9).  It  should  be  noted  that  many  of  the  Transcript  IDs  correspond  to  the 
same  gene  (symbol).  Based  on  these  results,  the  peaks  (Table  8)  and  gene  list 
(Table  9)  indicate  the  genomic  locations  where  MBD1  was  bound  in  normal  prostate 
tissue  from  sample  #3.  Each  ChIP-Seq  data  set  generates  similar  sized  arrays  of 
results,  whether  malignant,  normal  or  benign  or  an  immunoprecipitation  (ChIP)  of 
MBD1,  MBD2,  MBD4  or  MeCP2.  The  results  for  MBD1  are  shown  in  Tables  8-15 
including  a  list  of  normal  tissue-specific  peaks  and  malignant  specific  peaks  for 
Sample  (or  Patient)  #3  and  closest  genes  to  normal-specific  peaks  and  malignant- 
specific  peaks  for  all  three  Samples  (Patient  #3,  #4  and  #6).  This  set  of  tables  is 
listed  in  the  Supporting  Data  at  the  end,  but  it  is  also  listed  below  to  help  clarify  what 
is  present. 

Table  8:  Patient  3  Normal  MBD1  Peaks 
Table  9:  Patient  3  Normal  MBD1  Gene  List 
Table  10;  Patient  4  Normal  MBD1  Gene  List 
Table  11:  Patients  Normal  MBD1  Gene  List 
Table  12:  Patient  3  Malignant  MBD1  Peaks 
Table  13:  Patient  3  Malignant  MBD1  Gene  List 
Table  1410;  Patient  4  Malignant  MBD1  Gene  List 
Table  15:  Patient  6  Malignant  MBD1  Gene  List 

NCTE:  Tables  for  MBDI-specific  Normal  and  Malignant  Peaks  for  Samples  #4  and 
#6  and  all  of  the  MBD2,  MBD4  and  MeCP2  results  using  Genomics  Suite  (Partek) 
were  not  included  in  an  attempt  to  minimize  the  number  of  tables  in  this  document. 
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I  need  to  address  an  important  technical  point  regarding  the  analysis 
performed  on  Partek  Genomics  Suite.  One,  as  this  analysis  was  being  performed,  the 
small  number  of  common  sites  was  surprising.  I  followed  up  this  point  in  two  ways.  I 
used  the  “genome  browser”  function  which  allows  me  to  visually  compare  peaks  in 
different  samples  and  found  that  some  common  sites  seemed  to  be  present  that 
were  not  being  identified  by  the  software  package.  A  visual  inspection  is  one  method 
for  identifying  the  common  peaks.  While  the  visual  inspection  of  one  chromosome  for 
comparison  is  possible,  each  autosomal  and  sex  chromosome  is  presented 
separately  so  ChIP-Seq  of  each  tissue  would  require  24  visual  inspections  (22 
autosomal,  X  and  Y  chromosomes).  This  is  impractical.  That  is  part  of  the  reason  that 
RSEG/Galaxy  and  Bowtie/MACS  was/is  being  used  for  this  analysis.  Two,  I 
attempted  to  identify  common  peaks  in  three  related  sets  of  results  simultaneously. 
This  software  package  is  supposed  to  have  the  capability  to  perform  this  analysis,  but 
the  output  in  three  columns  was  not  making  sense.  After  discussions  with  customer 
support  at  Partek,  it  is  likely  there  is  a  “bug”. 

The  first  task  was  broken  down  into  four  parts  and  the  first  three  parts  (perform 
ChIPs,  prepare  samples  for  NGS  and  perform  NGS)  have  been  completed.  Two 
matched  malignant/normal  tissue  samples  and  one  matched  malignant/benign 
sample  was  used  in  our  ChIP-Seq  studies.  The  bioinformatic  analysis  is  the  fourth 
task  and  some  important  and  representative  analysis  has  been  presented  in  this 
report.  The  bioinformatics  analysis  will  be  continued  to  generate  additional 
information.  The  second  task  is  identifying  both  genes  that  are  differentially 
expressed  in  malignant  prostate  tissues  and  corresponding  gene  promoters  that 
differentially  bound  by  MBD  proteins  in  malignant  prostate  tissues.  Microarray 
experiment  has  been  performed  and  those  results  have  been  used  to  generate 
correlations  between  sites  of  MBD  family  association  and  differentially  regulated 
genes  in  malignant  tissues.  The  steps  involved  in  the  two  tasks  have  been  detailed 
above. 
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The  third  task  is  the  writing  and  submission  of  a  manuscript.  The  delay  involved 
in  preparing  samples  for  NGS  and  the  time-consuming  nature  of  the  data  analysis 
has  meant  that  this  task  has  not  been  completed.  The  results  described  above  will  be 
used  in  the  future  for  both  a  manuscript  and  in  grant  applications. 


Key  Research  Accomplishments 

-  Chromatin  Immunoprecipitation  using  matched  malignant  and  normal/benign 
prostate  tissues  were  performed  with  antibodies  directed  against  MeCP2, 
MBD1,  MBD2  and  MBD4  (methyl-CpG  binding  domain  containing  proteins) 

-  “Next  Generation  Sequencing  of  Chi P  experiments  described  was  performed. 

-  The  bioinformatics  analysis  of  NGS  data  has  been  undertaken.  Bioinformatic 
analysis  is  still  on-going.  Differences  in  the  MBD  association  profiles  between 
matched  tissues  were  observed.  Genes  overlapping  the  binding  sites  of  MBD 
proteins  were  identified.  The  microarray  analysis  was  used  to  determine  which 
of  the  overlapping  genes  were  up-regulated  or  down-regulated. 

-  mRNAs  has  been  isolated  from  each  tissue.  Small  RNAs  (miRNAs  being  of 
interest)  have  been  isolated  from  some  matched  tissues. 

-  The  mRNA  from  Stage  3  prostate  cancer  tissue  and  normal  tissue  was  pooled, 
respectively,  and  analyzed  using  Affymetrix  ST  Human  Gene  2.0  microarrays. 
The  analysis  identified  genes  that  are  up-regulated  and  down-regulated  in 
malignant  tissues  -  see  Supporting  Data. 
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Reportable  Outcomes 

1.  The  association  site  for  MBD1,  MBD2,  MBD4  and  MeCP2  in  matched  sets 
of  malignant  and  normal  or  benign  prostate  tissues 

2.  Using  information  in  point  #1  to  identify  the  common  association  site  for 
MBD1  and  MeCP2  among  three  matched  sets  of  tissues 

3.  Identification  of  genes  that  overlap  with  the  common  association  sites  in 
point  #2 

4.  Identification  of  genes  from  point  #3  that  are  up-regulated  or  down- 
regulated  by  microarray  analysis  of  prostate  tissues 

5.  Using  information  in  point  #1  to  identify  common  association  site  for  MBD1 
and  MeCP2  among  two  matched  sets  of  tissues  (that  is  a  pair-wise 
comparison  instead  of  the  three  way) 

6.  Using  information  in  point  #1  to  identify  common  association  site  for  MBD2 
and  MBD4  among  two  matched  sets  of  tissues 

7.  Microarray  analysis  of  malignant  and  normal  prostate  tissue  to  identify 
genes  that  are  consistently  up-regulated  or  down-regulated. 

8.  Additional  question  are  being  addressed  by  continuing  the  bioinformatics 
analysis 
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Conclusions 

This  study  has  identified  the  association  sites  of  MBD  family  members  in 
matched  malignant  and  normal  or  benign  prostate  tissues  through  ChIP-Seq 
experiments  and  bioinformatics  analysis,  then  used  these  results  to  identify  genes 
that  overlap  these  regions.  These  results  are  important  because  they  provide  a 
genome-wide  entry  to  understanding  one  type  of  epigenetic  regulation  that  is  often 
involved  in  cancer.  The  implication  is  that  some  of  the  sites  or  regions  of  MBD1, 
MBD2,  MBD4  and  MeCP2  association  that  are  specific  to  malignant  tissues  are 
important  to  tumor  progression.  From  a  technical  stand-point,  this  study  is  useful  in 
contributing  to  early,  but  growing,  literature  that  ChIP-Seq  can  be  used  on  clinical 
(patient)  samples. 

The  results  are  summarized  by  a  variety  of  tables  -  in  the  supporting  data  - 
that  list  the  malignant-specific  association  sites  for  these  MBD-family  members  and 
the  regions  containing  overlapping  genes  and  the  microarray  analysis.  This 
information  generates  two  types  of  scientific  hypotheses.  One,  that  the  identified 
genes  are  involved  in  tumor  progression  and  that  the  association  of  MBD  family 
members  regulates  their  expression  in  a  manner  that  leads  to  cancer.  Two,  that  the 
identified  genes  may  not  be  involved  in  tumor  progression  but  their  expression 
changes  may  be  useful  as  biomarkers.  In  the  future,  I  will  propose  studies  to 
characterize  those  hypotheses  for  selected  genes.  By  increasing  the  understanding 
of  gene  regulation  leading  to  malignant  cancers,  a  critical  mass  will  be  reached 
where  the  disease  can  be  addressed  in  a  differential  manner.  Cancer  is  such  a 
complicated  disease  that  ideas  of  “one  size  fits  all”  are  naive.  However,  a  more 
robust  understanding  of  the  basic  science  underlying  the  process  will  allow  clinicians 
to  determine  the  specific  pathways  have  been  corrupted,  how  the  pathways 
intertwine  and  the  stage  of  the  process  and  thus  tailor  treatment  accordingly. 
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Supporting  Data 

Table  1 .  Genes  that  are  up-regulated  in  Stage  3  prostate  tumors  (top  98) 

Table  2.  Genes  that  are  down-regulated  in  Stage  3  prostate  tumors  (top  98) 

Table  3.  Malignant-Specific  MBD1  Peaks  Common  to  All  Samples 

Table  4.  Gene-Containing  Regions  Overlapping  Malignant-Specific  MBD1  Peaks 

Table  5.  Malignant-Specific  MeCP2  Peaks  Common  to  All  Samples 

Table  6.  Gene-Containing  Regions  Overlapping  Malignant-Specific  MeCP2  Peaks 

Table  7.  Malignant-Specific  MBD4  Peaks  Common  to  the  Two  Patient  Samples 

Table  8.  MBD1 -Specific  Peaks  in  Patient  #3  Normal  Tissue 

Table  9.  Genes  Nearest  MBD1 -Specific  Peaks  in  Patient  #3  Normal  Tissue 

Table  10.  Genes  Nearest  MBD1 -Specific  Peaks  in  Patient  #4  Normal  Tissue 

Table  11.  Genes  Nearest  MBD1 -Specific  Peaks  in  Patient  #11  Normal  Tissue 

Table  12.  MBDI-Specific  Peaks  in  Patient  #3  Malignant  Tissue 

Table  13.  Genes  Nearest  MBDI-Specific  Peaks  in  Patient  #3  Malignant  Tissue 

Table  14.  Genes  Nearest  MBDI-Specific  Peaks  in  Patient  #4  Malignant  Tissue 

Table  15:  Genes  Nearest  MBDI-Specific  Peaks  in  Patient  #5  Malignant  Tissue 
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TABLE  1:  Genes  up-regulated  in  Stage  3  Prostate  Cancer 


Probe  ID 

Stage  3 

Normal 

16888999 

582.4666 

13.2632 

16797447 

101.4443 

2.81261 

16900148 

291.9534 

9.17729 

16734799 

559.6259 

17.6337 

16882744 

226.7969 

8.43171 

17092187 

1071.185 

42.6476 

16877453 

139.4926 

6.79325 

16927869 

253.0766 

15.1123 

16994426 

43.19071 

2.90542 

16906620 

934.7865 

64.4569 

17086067 

713.7975 

50.5516 

16797494 

113.5486 

8.53564 

16987773 

59.76891 

4.57236 

16734793 

736.5161 

61.2429 

16900152 

68.56734 

5.88263 

16994424 

45.59653 

3.9256 

16713395 

44.09912 

3.8074 

16870453 

400.378 

36.6231 

16721212 

70.11575 

6.46355 

16797415 

1435.142 

136.309 

16797583 

53.47449 

5.19404 

16882750 

203.5067 

20.8347 

16986655 

295.422 

30.5889 

16797512 

172.9989 

18.1594 

16797587 

41.84327 

4.42816 

16797389 

807.075 

85.6432 

16927853 

1388.054 

147.93 

16927834 

698.1865 

75.5693 

16818443 

479.6125 

51.9562 

16900098 

561.5751 

60.9755 

17095887 

141.7376 

15.7169 

16721210 

54.4311 

6.12543 

16900104 

189.546 

21.9098 

16923068 

253.2604 

29.5595 

16797371 

313.0038 

37.6674 

17000168 

664.37 

80.7914 

16797516 

768.5807 

96.3621 

16927858 

1583.615 

199.798 

16976644 

277.1924 

35.2967 

16797403 

431.224 

55.1977 

16905600 

215.332 

27.6004 

17011886 

30.68319 

3.94657 

17052737 

236.6025 

30.8979 

16797336 

1011.111 

137.856 

16709458 

62.47979 

8.59147 

16927785 

644.8411 

89.7019 

16771434 

345.3532 

48.3531 

16797479 

208.7004 

30.5188 

GeneAccession 


NM  006911 


_016192 

015342 


NM  030774 


Gene  Symbol  mRNA  Accessioi 
TCONSJ2_000: 
IGHV4-31  BC073766 
;iGKV2-24  ENST00000484 

ENST00000357 
aGKVlD-27  ENST00000453 
RLNl  NM_006911 

ENST00000421 
:IGLC7  // IGLENST00000390 


TMEFF2 

PCA3 


NM_016192 
NR  015342 


iIGHV3-23  //  ENST00000390 


OR51E2 


NM_030774 

AY510106 


NM_ 

NM 


.000095 

001004753 


ENST0000046269; 
NM  003248 


COMP 
OR51F2 
IGHAl 
IGHAl 
IGKV2D-24 
THBS4 
43814;  IGHV4-31 
//  AKC  IGHAl  // IGh 


NM  017680 


IGK@ 

ASPN 


ENST0000046416;iGKVl-6  // K 


AK125238 
NM_004887 
Y14737  //  Y1473 
BC030984 
NM_144646 

NM_133378 

NM_002652 

NM  198795 


IGHAl 
CXCL14 
IGHM  //  IGHI 
IGLVl-44 
IGJ 

TTN 

PIP 

TDRDl 


NM_000095 

NM_001004753 

BC092449 

AF067420 

ENST00000462 

NM_003248 

ENST00000438 

AK097017 

AY172958 

AY172962 

ENST00000385 

TCONSJ2_000: 

BC016380 

NM_017680 

ENST00000365 

ENST00000464 

TCONS_000290 

AK125238 

NM_004887 

Y14737 

BC030984 

NM_144646 

AY510104 

NM_133378 

TCONS_000119 

NM_002652 

BC065733 

NM_198795 

ENST00000385 

AK289373 


Fold  Increase 
43.91596001 
36.06769944 
31.81258351 
31.73610461 
26.89808428 
25.11712854 
20.53399462 
16.7463543 
14.8655804 
14.50250027 
14.12016504 
13.30288686 
13.07178569 
12.0261526 
11.65590277 
11.61516587 
11.58247319 
10.93240459 
10.8478683 
10.52856223 
10.29535187 
9.767671464 
9.657817051 
9.526706888 
9.449366734 
9.42368818 
9.383181234 
9.239019322 
9.231098058 
9.209851239 
9.018142208 
8.886080562 
8.651209027 
8.567817453 
8.309666999 
8.223274204 
7.975961306 
7.926100176 
7.853210073 
7.812360162 
7.801776642 
7.774657202 
7.657559252 
7.334534101 
7.272303279 
7.188714126 
7.142316182 
6.838423148 
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16900096 

903.503 

132.689 

16797690 

29.51886 

4.3362 

16983765 

307.1956 

46.5068 

17056833 

492.772 

75.8418 

16900132 

989.4561 

156.082 

16805899 

84.82558 

13.7971 

17049676 

245.1839 

40.3748 

16907831 

255.7697 

42.375 

16798951 

232.3069 

38.7016 

16797409 

411.3531 

68.8531 

16854408 

119.953 

20.2775 

16889743 

44.3531 

7.52407 

16900100 

43.68627 

7.48967 

16800630 

69.86362 

12.0546 

16650201 

15.91893 

2.76689 

16927810 

202.2847 

35.167 

16927756 

267.8531 

47.3531 

17122982 

1407.769 

249.319 

16882720 

1287.047 

229.739 

16797504 

101.5519 

18.2327 

17052973 

371.3268 

66.7751 

16988423 

120.4087 

21.8845 

17124830 

80.73445 

14.9903 

16771433 

260.3532 

48.8531 

16859795 

1084.581 

204.641 

16962689 

88.84472 

16.8925 

16771432 

246.8531 

47.3531 

16882723 

580.8939 

111.433 

16725041 

85.73141 

16.4861 

16826110 

27.76014 

5.36225 

16771448 

82.3531 

15.9189 

17041111 

572.3929 

111.749 

17031045 

542.9544 

106.942 

17041030 

576.8534 

113.619 

16908616 

17.87779 

3.52748 

17007118 

569.0654 

112.403 

17035749 

576.0376 

113.78 

16797353 

386.5006 

76.4972 

17068918 

42.02176 

8.31722 

17007048 

554.1838 

109.97 

17028176 

585.5333 

116.728 

16709245 

114.3195 

22.9462 

16927840 

137.7116 

27.8571 

17017217 

95.35309 

19.3674 

16797563 

130.4524 

26.5211 

16652211 

9.485942 

1.96101 

16797459 

25.60906 

5.2969 

16797603 

72.77382 

15.0733 

16778241 

574.0981 

120.903 

NM  000908 


NR_ 

NM_ 

NM_ 

NM 


.024387 

000602 

079420 

013372 


AK090461 
NR  024259 


NM  032413 


ENST0000039031( 


ENST0000049857^ 


NM_ 

NM 


NM_ 

NM 


.014141 

016644 


.004864 

001146686 


ENST0000046517( 


NM  001002029 


NM_007293 

NM_000681 

ENST0000039031f 

BC089417  //  BCC 


NM  006475 


IGK(g) 

BC032451 

NPR3 

NM_000908 

... 

AF085845 

:IGKV1-16 

ENST00000479 

CXADRP2 

NR_024387 

SERPINEl 

NM_000602 

MYLl 

NM_079420 

GREMl 

NM_013372 

IGHD 

AK090461 

LOC728606 

NR_024259 

iIGKVl-5  // 

I(ENST00000496 

C15orf48 

NM_032413 

:IGLV2-18  // 

:ENST00000390 

— 

BC073769 

IGKVl-39 

ENST00000498 

... 

AK303185 

CNTNAP2 

NM_014141 

PRR16 

NM_016644 

GDF15 

NM_004864 

GEMCl 

NM_001146686 

:iGKVl-37 

ENST00000465 

FAMlllB 

AK304258 

_ 

ENST00000450 

C4B 

NM_001002029 

:iGHG4  // IGhENST00000390 

... 

ENST00000520 

C4A 

NM_007293 

ADRA2A 

NM_000681 

iIGLV3-l 

ENST00000390 

:iGHGl  // IGhBC089417 

IGHM 

BC041037 

IGHGl 

AK127409 

POSTN 

NM_006475 

6.809198379 

6.807538025 

6.60539246 

6.497369457 

6.339335093 

6.148064234 

6.072691823 

6.035863127 

6.002520312 

5.974358453 

5.915574363 

5.894826631 

5.832872921 

5.795588745 

5.753365692 

5.752120313 

5.656502537 

5.646459213 

5.602204063 

5.569773615 

5.5608556 

5.502005757 

5.385786656 

5.329307659 

5.299922938 

5.259414971 

5.213028247 

5.212929169 

5.200220671 

5.176953572 

5.173281119 

5.122120785 

5.077082761 

5.07708123 

5.06815211 

5.062737817 

5.062736037 

5.052479692 

5.052382305 

5.039431842 

5.016241246 

4.982060234 

4.943508037 

4.923373416 

4.918817439 

4.83727365 

4.834722321 

4.828011212 

4.748415053 


TABLE  2:  Genes  down-regulated  in  Stage  3  Prostate  Cancer 


Probe  ID 

Stage  3 

Normal 

16940033 

11.826 

638.68 

17122144 

6.2387 

259.35 

16656659 

6.0239 

215.79 

16656655 

3.2542 

113.61 

16656633 

10.449 

318.04 

16656621 

5.4577 

138.86 

16940012 

250.17 

5124 

16656649 

12.12 

243.19 

16656663 

12.839 

242.21 

16656629 

29.143 

521.81 

16838249 

22.82 

391.11 

17018396 

5.6622 

94.853 

16656641 

40.009 

669.33 

16946280 

24.948 

415.38 

16656643 

20.828 

333.68 

16775083 

42.011 

659.24 

16656665 

14.388 

225.43 

16873204 

13.385 

196.08 

16656647 

49.361 

712.8 

16656657 

33.037 

466.29 

16744602 

16.991 

236.93 

16656623 

25.087 

346.64 

16852849 

40.26 

550.43 

16877318 

8.6936 

114.85 

16887010 

10.775 

140.67 

16656631 

55.242 

719.1 

16656645 

21.43 

269.83 

16656625 

21.654 

271.14 

16656627 

21.093 

243.64 

16781423 

2.8489 

30.936 

16869919 

22.381 

241.27 

17095327 

3.5275 

37.353 

16656637 

100.16 

1030.9 

16656653 

5.0109 

48.755 

16656365 

4.0486 

38.353 

16681986 

5.2349 

47.836 

16688386 

8.3121 

73.184 

17020069 

30.737 

270.55 

16656661 

80.853 

708.35 

16873190 

34.984 

303.99 

16990124 

4.8672 

41.853 

16809872 

17.459 

146.24 

17118860 

270.37 

2236.1 

17009219 

30.692 

251.43 

16886806 

8.3794 

68.485 

16656609 

13.688 

111.69 

16938645 

9.5682 

73.627 

16821186 

11.575 

89.039 

GeneAccession  Gene  Symbol  mRNA  Accession  Decreased  Exp 

ENST000004272! 


NM_003241  TGM4 


NM_022131  CLSTN2 
NM_006418  OLFM4 

NM_00107763SFAM55D 
NM_080475  SERPINBll 


NM_00111412CDEPDC1 
NM_145740  GSTAl 

NM_001 102597  CEACAM20 

NM_00101792CDAPL1 

NM_005752  CLEC3A 


0.018516648 
0.024054931 
0.027915161 
0.028642713 
0.032854283 
0.039303895 
NM_003241  0.048823623 

0.049838728 
0.053008222 
0.055849803 
TCONS_0002513  0.058346925 
0.059694448 
0.059774216 
NM_022131  0.060061471 

0.06241899 
NM_006418  0.063725796 

0.063825701 
TCONSJ2_OOOi;  0.068261563 
0.06924903 
0.070849745 
NM_001077639  0.071714189 

0.072372787 
NM_080475  0.07314271 

TCONS_0000358  0.075693159 
ENST000004189(  0.076600668 
0.076821157 
0.079420414 
0.079861297 
0.086574312 
0.092088406 
TCONS_0002720  0.092762895 
0.094435991 
0.097158633 
0.102777474 
0.105560776 
AK055853  0.109432777 

NM_001114120  0.113578084 

NM_145740  0.113609429 

0.114142303 
NM_001102597  0.11508526 

0.11629143 
0.119387685 
0.120908221 
0.122070117 
NM_001017920  0.122354193 

0.122545255 
0.129953918 
NM_005752  0.130000478 


TABLE  2:  Genes  down-regulated  in  Stage  3  Prostate  Cancer 


16781648 

13.8 

104.2 

NM 

17009989 

3.5174 

25.379 

... 

16888229 

32.909 

237.35 

... 

16656639 

64.075 

446.44 

... 

16844735 

23.823 

165.57 

NM 

16977810 

11.475 

78.582 

NM 

16771533 

4.4345 

30.353 

16789823 

6.9965 

47.319 

... 

17020059 

11.515 

77.374 

NM 

16650169 

5.0524 

33.713 

... 

17020258 

31.145 

204.57 

NM 

17084374 

9.1675 

59.874 

... 

16790138 

14.381 

93.802 

... 

16735833 

35.353 

227.85 

17083845 

5.7656 

36.864 

... 

16673840 

70.694 

419.46 

— 

16968307 

136.15 

806.33 

NR 

16656603 

4.4345 

25.853 

... 

16771531 

4.8672 

28.353 

16744585 

5.5649 

32.339 

NM 

16745561 

162.85 

932.45 

... 

16656607 

43.983 

251.08 

... 

16698502 

37.838 

211.67 

... 

16654487 

6.8124 

37.411 

— 

16656247 

2.8242 

15.495 

... 

16654391 

4.1713 

22.781 

... 

16805995 

10.55 

57.4 

... 

16906089 

13.132 

71.233 

... 

16656015 

13.046 

69.471 

... 

16652875 

13.203 

70.054 

... 

16656289 

11.398 

59.602 

... 

16655523 

2.7914 

14.564 

... 

16653061 

11.394 

58.497 

... 

16656619 

21.373 

108.99 

... 

16656651 

6.8494 

34.853 

... 

16859166 

10.199 

51.53 

NM 

16656153 

6.9357 

34.667 

— 

16651031 

3.7924 

18.865 

... 

16751999 

12.581 

62.353 

17068615 

9.4859 

46.353 

16771468 

4.0486 

19.367 

16651193 

9.1221 

43.522 

... 

16693526 

10.319 

48.853 

17092748 

8.3227 

39.388 

17011664 

32 

151.31 

... 

17075920 

17.939 

83.887 

... 

16835637 

7.5241 

34.853 

16790255 

6.2286 

28.353 

16993425 

20.361 

91.853 

014579  SLC39A2 


.153490 

178135 


.000846 

021073 


KRT13 

HSD17B13 


GSTA2 

BMP5 


026555  GDEP 


152315 


FAM55A 


NM  014579 


NM_153490 

NM_178135 

BC015119 

NM_000846 

NM_021073 

AK026285 

ENST000005171! 

NR  026555 


NM  152315 


023944  CYP4F12 


NM  023944 


0.132431439 

0.138599132 

0.138650689 

0.143524513 

0.143881982 

0.146031581 

0.146096891 

0.147857638 

0.148820521 

0.149864505 

0.152249232 

0.1531148 

0.153312867 

0.155157492 

0.156402214 

0.168534858 

0.168856087 

0.1715249 

0.171661985 

0.172078762 

0.17464698 

0.175174535 

0.178762811 

0.182095968 

0.182260341 

0.183103006 

0.183794102 

0.184349488 

0.187792919 

0.188474016 

0.191227413 

0.191664653 

0.194775024 

0.196096012 

0.196521314 

0.197931852 

0.200064557 

0.201031031 

0.201766071 

0.204645256 

0.209040745 

0.20959556 

0.211223443 

0.211301023 

0.211485799 

0.213850672 

0.215879563 

0.219679295 

0.221670726 


TABLE  3 


MALIGNANT-SPECIFIC  MBDl  PEAKS 
COMMON  TO  ALL  SAMPLES 


chromosome 

start 

stop 

chrl 

12523050 

14268300 

chrl 

11612S050 

116608500 

chr2 

0 

758700 

chr2 

6571500 

6805800 

chr2 

91S31350 

91876950 

chr2 

22SS59650 

229637550 

chr3 

60972750 

61300200 

chr3 

61425150 

61445850 

chr3 

90418200 

90427350 

chr3 

196933800 

197440200 

chr3 

197896650 

198295500 

chrS 

0 

56250 

chrS 

56550 

400650 

chrS 

79544250 

80367900 

chr6 

73826250 

74213850 

chr6 

74214000 

74715900 

chr6 

160291200 

160698450 

chr6 

161017050 

161046750 

chr7 

149883750 

152072250 

chr7 

158249250 

158423700 

chr8 

98590350 

99118500 

chrlO 

37540500 

37678800 

chrll 

51373350 

51568050 

chrl2 

7459200 

8020500 

chrl2 

8021700 

8246850 

chrl4 

72603750 

73275750 

chrl4 

73275900 

73314900 

chrl4 

88099800 

88284750 

chrl7 

21554250 

21902400 

Chris 

4336500 

5065350 

Chris 

7039200 

7672950 

chrX 

138940350 

142020000 

TABLE  4 


GENE-CONTAINING  REGIONS 
OVERLAPPING  MALIGNANT- 
SPECIFIC  MBDl  PEAKS 


chromosome  start 

stop  decreased 

increased 

chrl 

12523050 

14268300 

chrl 

116128050 

116608500  SLC22A15 

CASQ2 

chr2 

0 

408600  FAMllOC 

chr2 

91831350 

91832550 

chr2 

228893250 

229140900 

chr3 

60972750 

61300200 

chr3 

196933800 

197425800 

chr3 

197426250 

197440200 

KIAA0226 

chr3 

197896650 

197929950 

chr3 

197931000 

198295500 

chrS 

56550 

400650  PLEKHG4B,  PDCD6 

CCDC127 

chrS 

79947900 

79999800 

DHFR 

chr6 

74082600 

74213850 

chr6 

74214000 

74517000 

chr6 

160291200 

160698450 

chr6 

161017050 

161046750 

chr7 

150249900 

150852300 

chr7 

158249250 

158322450 

PTPRN2 

chr7 

158323800 

158423700 

PTPRN2 

chr8 

98606250 

98729250 

chr8 

98790900 

99118500  HRSP12 

SNORNA72 

chrll 

51374400 

51471450 

chrll 

51471750 

51567300 

chrl2 

7459200 

8020500 

chrl2 

8021700 

8023200 

chrl4 

72966600 

73275750 

DPF3 

chrl4 

73275900 

73314900 

DPF3 

chrl7 

21554250 

21902400 

chrl8 

7039200 

7224750 

chrX 

138940350 

140755650 

chrX 

140755950 

140875800 

TABLE  5 


MALIGNANT-SPECIFIC  MeCP2  PEAKS 
COMMON  TO  ALL  SAMPLES 


chromosome 

chrl 

start 

212202300 

stop 

213467700 

chrl 

214479900 

214782600 

chrl 

222022200 

222201450 

chr2 

89222850 

89830350 

chr2 

111501750 

113411100 

chr2 

133046400 

133685700 

chr2 

158956800 

159025200 

chr2 

228379500 

228858900 

chr3 

59997150 

60304200 

chr3 

75700650 

75715950 

chr3 

140437800 

140608050 

chr4 

44100 

59250 

chr4 

49521150 

49527900 

chr4 

111155100 

111810300 

chr4 

185137350 

186126900 

chr5 

151869450 

151945800 

chr5 

153061200 

154112250 

chr7 

152664300 

153193650 

chrS 

43097100 

43235700 

chr9 

66837000 

66840000 

chr9 

126298350 

130003950 

chrlO 

106217550 

106511400 

chrlO 

129684750 

130794900 

chrlO 

131132550 

132382200 

chrll 

47570250 

48229800 

chrll 

123208500 

124324350 

chrll 

128769000 

129463800 

chrll 

132696150 

133383600 

Chris 

93206700 

93763500 

chrlS 

65125350 

65733300 

Chris 

7039200 

7672950 

chr20 

17567250 

19164000 

chr20 

23884350 

24613500 

chr20 

24613950 

25904550 

chr20 

40813950 

41133600 

chr20 

44077350 

45155700 

chr22 

20658000 

20689050 

chrY 

9992550 

10017600 

TABLE  6 


GENE-CONTAINING  REGIONS 
OVERLAPPING  MALIGNANT- 
SPECIFIC  MeCP2  PEAKS 


chromosome 

start 

end  decreased 

increased 

chrl 

213434550 

213467700 

RPS6KC1 

chrl 

214479900 

214782600 

chr2 

89222850 

89830350 

chr2 

111501750 

112198050  ACOXL 

chr2 

133046400 

133685700 

LYPDl,  NCKAP5 

chr2 

158990400 

159025200 

chr2 

228379500 

228858900 

chr3 

59997150 

60107100 

chr3 

75702900 

75715650 

chr4 

111155100 

111810300 

chr4 

185639100 

186126900 

chrS 

153061200 

153146700  GRIAl 

chr8 

43097400 

43235700 

chr9 

127475250 

128050800 

chrlO 

106217550 

106511400  SORCS3 

chrll 

47834250 

48229800 

chrll 

123208500 

124041300 

chrll 

124041450 

124324350 

chrll 

128769000 

129178350  Cllorf45 

chrll 

133287300 

133383600 

chrlS 

93732150 

93763500 

chrl6 

65125350 

65692350 

CDHll 

Chris 

7224900 

7672950 

PTPRM 

chr20 

17871000 

18138750 

chr20 

24493950 

24613500 

chr20 

24613950 

25222500 

chr20 

25223250 

25423950  GINSl 

chr20 

40958250 

41133600 

PTPRT 

chr20 

44077350 

44654700 

TABLE  7 


MALIGNANT-SPECIFIC  MBD4  PEAKS 
COMMON  TO  THE  TWO  PATIENT  SAMPLES 


Patient  3 


chrl 

48451650 

chrl 

48451650 

chrl 

53847900 

chrl 

54833400 

chrl 

90298800 

chrl 

155525550 

chrl 

155858700 

chrl 

160212600 

chrl 

227740800 

chrl 

227740800 

chrl 

228058500 

chrl 

228058500 

chrl 

228423750 

chrl 

228612600 

chrl 

228870750 

chrl 

231021300 

chrl 

231021300 

chrl 

244950600 

chrlO 

47668350 

chrlO 

73386900 

chrlO 

119401950 

chrlO 

120360600 

chrlO 

132578400 

chrll 

32156700 

chrll 

43654050 

chrll 

48868650 

chrll 

71920200 

chrll 

71920200 

chrll 

72776400 

chrll 

73114800 

chrll 

78640500 

chrll 

126216600 

chrll 

126216600 

chrll 

134626650 

chrl2 

1986900 

chrl2 

1986900 

chrl2 

1986900 

chrl2 

1986900 

chrl2 

30933150 

chrl2 

61923150 

chrl2 

64015650 

chrl2 

64860450 

chrl3 

24174000 

48705750  chrl 

48705750  chrl 

54832200  chrl 

54921300  chrl 

90887550  chrl 

155858550  chrl 

156243150  chrl 

160679550  chrl 

228058350  chrl 

228058350  chrl 

228422700  chrl 

228422700  chrl 

228610050  chrl 

228870600  chrl 

229166700  chrl 

231344700  chrl 

231344700  chrl 

244960350  chrl 

49775550  chrlO 

73392900  chrlO 

119546550  chrlO 

120952350  chrlO 

133241850  chrlO 

32287950  chrll 

44115450  chrll 

48876600  chrll 

72774300  chrll 

72774300  chrll 

73114200  chrll 

73189050  chrll 

79104900  chrll 

126509400  chrll 

126509400  chrll 

135086550  chrll 

2656500  chrl2 

2656500  chrl2 

2656500  chrl2 

2656500  chrl2 

31372800  chrl2 

62091450  chrl2 

64087500  chrl2 

65359200  chrl2 

24556800  chrl3 


Patient  6 

47541600 

48588300 

48685350 

48703350 

54372150 

55172400 

54372150 

55172400 

90763800 

90771300 

154907700 

156195150 

154907700 

156195150 

160212600 

160447500 

227613900 

228042900 

228043350 

228110700 

228216150 

229731900 

228043350 

228110700 

228216150 

229731900 

228216150 

229731900 

228216150 

229731900 

231122100 

231463650 

230768100 

231121950 

244468200 

245469900 

47629050 

47941500 

72137850 

73530600 

119192850 

120390750 

119192850 

120390750 

133174200 

133264950 

31635000 

32223750 

43877700 

44000550 

48868800 

48877500 

72122250 

72505050 

72505200 

73249050 

72505200 

73249050 

72505200 

73249050 

78370500 

79122000 

126349950 

126376350 

126092850 

126349500 

134589900 

135086550 

2205750 

2294250 

1696800 

2205600 

2294400 

2343900 

2344050 

2356050 

31322400 

31562700 

61856550 

61927650 

63922350 

64744350 

64878750 

65446350 

23889900 

24759750 

chrl3 

74046750 

chrl4 

73446600 

chrl4 

73522200 

chrl4 

78194250 

chrl4 

78312750 

chrl4 

100137000 

chrl5 

28277100 

chrl5 

72331200 

chrl5 

72331200 

chrl5 

93207600 

chrl5 

93747600 

chrl6 

27456300 

chrl6 

27688050 

chrl6 

54940800 

chrl7 

26460300 

chrl7 

26695050 

chrl7 

26695050 

chrl7 

27084900 

chrl7 

27084900 

chrl7 

27084900 

chr2 

9548700 

chr2 

9586800 

chr2 

9586800 

chr2 

10096800 

chr2 

26814300 

chr2 

26814300 

chr2 

42857400 

chr2 

72019200 

chr2 

72019200 

chr2 

72019200 

chr2 

109627200 

chr2 

109678950 

chr2 

120928800 

chr2 

124640550 

chr2 

127222800 

chr2 

128157450 

chr2 

128970000 

chr2 

128990100 

chr2 

130040550 

chr2 

132988800 

chr2 

178087950 

chr2 

178087950 

chr2 

225591600 

chr2 

240399750 

chr20 

7518750 

chr20 

24716550 

chr20 

25174950 

chr20 

29652450 

chr21 

9703500 

chr21 

10766850 

chr22 

27620250 

chr3 

13620300 

74453700  chrl3 

73521900  chrl4 

73546650  chrl4 

78312600  chrl4 

78394050  chrl4 

100677750  chrl4 

28783650  chrl5 

73209600  chrl5 

73209600  chrl5 

93427800  chrl5 

94232400  chrl5 

27687900  chrl6 

28160550  chrl6 

55819350  chrl6 

26694900  chrl7 

27083550  chrl7 

27083550  chrl7 

27775950  chrl7 

27775950  chrl7 

27775950  chrl7 

9585750  chr2 

10096650  chr2 

10096650  chr2 

11310000  chr2 

27428550  chr2 

27428550  chr2 

42873750  chr2 

72273000  chr2 

72273000  chr2 

72273000  chr2 

109678050  chr2 

109815750  chr2 

121146150  chr2 

124660350  chr2 

128156850  chr2 

128969100  chr2 

128989950  chr2 

129090000  chr2 

130963500  chr2 

132994950  chr2 

178348200  chr2 

178348200  chr2 

225905850  chr2 

240759600  chr2 

7867200  chr20 

25172100  chr20 

25584150  chr20 

29804100  chr20 

9710550  chr21 

10770450  chr21 

28537200  chr22 

13929300  chr3 


73883700 

74180850 

73098450 

74237700 

73098450 

74237700 

77162850 

78426150 

77162850 

78426150 

99734550 

100515900 

28388550 

28462050 

72578850 

72604050 

73065450 

73180350 

93285750 

93696600 

93699600 

93843300 

26827500 

27614700 

28032300 

28137000 

55255050 

55279500 

26480550 

26912250 

26480550 

26912250 

26912850 

27257100 

26912850 

27257100 

27257400 

27544350 

27544500 

28207350 

9362250 

9869550 

9362250 

9869550 

9869700 

10677000 

9869700 

10677000 

26421600 

27201750 

27201900 

27342900 

42378000 

43356000 

72181500 

72191700 

71745300 

72181350 

72192000 

72203850 

109185900 

109815750 

109185900 

109815750 

120649200 

121886550 

124621950 

124650000 

127952100 

129696300 

127952100 

129696300 

127952100 

129696300 

127952100 

129696300 

129964200 

130171950 

132988050 

132995850 

178223400 

178562100 

178053300 

178223250 

225636450 

225993000 

240016650 

240600450 

7718550 

7800750 

25063500 

25269750 

25063500 

25269750 

29652000 

29804400 

9703200 

9712050 

10765050 

10772850 

27768150 

28350300 

13477200 

13768800 

chr3 

13929600 

14577900 

chr3 

32480700 

33645900 

chr3 

54449850 

54937350 

chr3 

90475350 

90486000 

chr3 

150364050 

150446550 

chr3 

185415450 

186051450 

chr4 

12287100 

12671400 

chr4 

12671550 

13084050 

chr4 

12671550 

13084050 

chr4 

53306250 

53427600 

chr4 

53428200 

53449650 

chr4 

68266650 

68326500 

chr4 

166484100 

166505850 

chr4 

185763000 

186126900 

chr5 

7547400 

7684350 

chr5 

7547400 

7684350 

chr5 

7843200 

7954950 

chr5 

17530500 

17580600 

chr5 

149702850 

150479100 

chr5 

149702850 

150479100 

chr5 

150795000 

150810750 

chr5 

170943900 

171013650 

chr5 

171113400 

171366150 

chr5 

171113400 

171366150 

chr6 

1698000 

1851750 

chr6 

2897550 

3328650 

chr6 

3328800 

3411900 

chr6 

121773900 

121803900 

chr7 

6247650 

6435000 

chr7 

28077150 

28401300 

chr7 

58022100 

58028550 

chr7 

128146200 

128715900 

chr7 

135332250 

135603150 

chr7 

154494150 

155272500 

chr8 

0 

758100 

chr8 

758400 

791550 

chr8 

20871600 

20932800 

chr8 

40601550 

41733000 

chr8 

86576400 

86726550 

chr8 

101609700 

101965950 

chr8 

101609700 

101965950 

chr8 

124796850 

124904100 

chr8 

143829600 

144751500 

chr9 

86948700 

87136200 

chr9 

99577950 

99771450 

chrX 

14254650 

15004200 

chrX 

111008250 

111412950 

chrY 

9966450 

9972000 

chr3 

13961250 

14026650 

chr3 

33311250 

33322650 

chr3 

54366000 

54661500 

chr3 

90478200 

90486300 

chr3 

150158550 

150643350 

chr3 

185187000 

185874750 

chr4 

12613800 

12641850 

chr4 

12816600 

12863250 

chr4 

12960900 

13053300 

chr4 

53088600 

53469150 

chr4 

53088600 

53469150 

chr4 

68266950 

68311950 

chr4 

166327500 

166925550 

chr4 

185818200 

185949600 

chr5 

7538700 

7667550 

chr5 

7667700 

7868550 

chr5 

7667700 

7868550 

chr5 

17526600 

17584650 

chr5 

150476100 

150828300 

chr5 

150170100 

150256800 

chr5 

150476100 

150828300 

chr5 

170553900 

171279150 

chr5 

170553900 

171279150 

chr5 

171279450 

171646950 

chr6 

1765800 

3367650 

chr6 

1765800 

3367650 

chr6 

1765800 

3367650 

chr6 

121771500 

121842150 

chr7 

6346350 

6384300 

chr7 

28136100 

28167150 

chr7 

58023000 

58032750 

chr7 

128673300 

128678400 

chr7 

134878200 

135483450 

chr7 

154607100 

154612950 

chr8 

482250 

837600 

chr8 

482250 

837600 

chr8 

20748900 

20976150 

chr8 

41583150 

42248400 

chr8 

86574300 

86728050 

chr8 

101818800 

101932800 

chr8 

101932950 

101940600 

chr8 

124601100 

125125650 

chr8 

143756250 

144110700 

chr9 

87029850 

87072750 

chr9 

99407850 

100405650 

chrX 

14191950 

14296650 

chrX 

111210900 

111227550 

chrY 

9959550 

9973350 

TABLE  8:  MBDl-SPECIFIC  PEAKS  IN 
SAMPLE  #3  NORMAL  TISSUE 


Chromos 

Location 

Start 

Stop 

Interval 

Length 

Maximum 
Extended 
Reads  in 
Window 

Reads  Per 
Million 
(RPM) 

Mann- 

Whitney 

p-value 

015_Input: 
total  reads 
in  region 

012:  total 
reads  in 
region 

p-value 
(Sample  ID 
vs. 

015_Input) 

scaled  fold 
change 

(Sample  ID  vs. 
015_Input) 

1 

143280363 

143280632 

269 

29 

24.6 

0.000949 

7 

1.15E-05 

4.5 

1 

152071255 

152071455 

200 

28 

23.8 

2.22E-07 

0 

1.59E-10 

29.0 

2 

89855492 

89855725 

233 

26 

22.1 

0.007076 

5 

30 

6.12E-07 

6.2 

2 

105451119 

105451403 

284 

42 

35.7 

0.018127 

0 

42 

2.01E-15 

43.0 

2 

116630336 

116630628 

292 

37 

31.4 

0.00866 

0 

37 

1.13E-13 

38.0 

2 

124311717 

124311917 

200 

25 

21.2 

8.85E-05 

0 

25 

1.79E-09 

26.0 

2 

171046320 

171046523 

203 

43 

36.5 

4.68E-06 

0 

42 

2.01E-15 

43.0 

3 

132593896 

132594115 

219 

29 

24.6 

0.019775 

0 

28 

1.59E-10 

29.0 

4 

49099603 

49099937 

334 

59 

50.1 

0.000247 

29 

65 

1.44E-06 

2.7 

4 

78671563 

78671765 

202 

44 

37.4 

0.01491 

0 

44 

4.02E-16 

45.0 

5 

90368423 

90368623 

200 

26 

22.1 

3.16E-05 

0 

26 

7.98E-10 

27.0 

5 

129033693 

129033894 

201 

59 

50.1 

6.18E-08 

0 

59 

2.26E-21 

60.0 

6 

57420994 

57421194 

200 

26 

22.1 

0.005279 

0 

26 

7.98E-10 

27.0 

6 

63742069 

63742269 

200 

35 

29.7 

1.81E-08 

0 

33 

2.84E-12 

34.0 

7 

81484848 

81485147 

299 

29 

24.6 

0.020106 

0 

29 

7.12E-11 

30.0 

7 

100243496 

100243697 

201 

31 

26.3 

0.0009 

0 

32 

6.35E-12 

33.0 

7 

133793925 

133794126 

201 

31 

26.3 

0.005249 

0 

29 

7.12E-11 

30.0 

8 

42274229 

42274429 

200 

33 

28.0 

l.OOE-04 

0 

37 

1.13E-13 

38.0 

8 

60439838 

60440045 

207 

35 

29.7 

1.28E-07 

0 

35 

5.66E-13 

36.0 

8 

67782070 

67782464 

394 

31 

26.3 

0.00302 

2 

31 

2.42E-09 

12.2 

10 

8536156 

8536492 

336 

35 

29.7 

6.73E-09 

0 

35 

5.66E-13 

36.0 

10 

19049775 

19049975 

200 

34 

28.9 

0.015793 

0 

34 

1.27E-12 

35.0 

11 

36507360 

36507561 

201 

27 

22.9 

0.000911 

0 

22 

2.00E-08 

23.0 

11 

48961677 

48962064 

387 

42 

35.7 

0.000786 

1 

43 

2.23E-14 

24.3 

12 

34841440 

34841733 

293 

36 

30.6 

0.019228 

24 

36 

0.0121762 

1.8 

12 

66148413 

66148613 

200 

39 

33.1 

9.48E-10 

0 

39 

2.26E-14 

40.0 

12 

73995168 

73995418 

250 

34 

28.9 

0.004634 

0 

34 

1.27E-12 

35.0 

13 

85554050 

85554250 

200 

35 

29.7 

6.63E-09 

0 

35 

5.66E-13 

36.0 

13 

96096292 

96096492 

200 

27 

22.9 

3.81E-07 

0 

27 

3.57E-10 

28.0 

14 

88379624 

88379824 

200 

38 

32.3 

0.01164 

2 

39 

5.90E-12 

15.3 

16 

22367793 

22367993 

200 

35 

29.7 

6.63E-09 

1 

35 

1.15E-11 

19.9 

16 

33963881 

33964262 

381 

58 

49.3 

0.000724 

19 

70 

6.01E-11 

4.3 

16 

46400199 

46400519 

320 

33 

28.0 

0.004751 

29 

42 

0.00996132 

1.8 

18 

50002111 

50002311 

200 

28 

23.8 

4.30E-07 

0 

27 

3.57E-10 

28.0 

19 

9710522 

9710725 

203 

27 

22.9 

0.004014 

20 

27 

0.0536516 

1.6 

19 

37333743 

37333943 

200 

26 

22.1 

3.61E-07 

0 

27 

3.57E-10 

28.0 

20 

11412592 

11412792 

200 

26 

22.1 

0.020913 

0 

26 

7.98E-10 

27.0 

21 

10856733 

10857069 

336 

41 

34.8 

0.003161 

9 

43 

1.91E-08 

5.3 

22 

18568833 

18569033 

200 

25 

21.2 

0.018258 

0 

25 

1.79E-09 

26.0 

Y 

13808692 

13808892 

200 

25 

21.2 

0.006342 

21 

25 

0.121031 

1.4 

Y 

58853928 

58854202 

274 

29 

24.6 

0.020345 

25 

32 

0.0543748 

1.6 

Y 

58886170 

58886424 

254 

28 

23.8 

0.008838 

18 

28 

0.0199038 

1.9 

TABLE  9:  GENES  NEAREST  MBDl-SPECIFIC  PEAKS  IN 
SAMPLE  #3  NORMAL  TISSUE 


Chromos 

Location 

Transcript 

Start 

Transcript 

Stop 

Strand 

Transcript  ID 

Gene 

Symbol 

Distance 
to  TSS 

Percent 
overlap 
with  gene 

Start 

Stop 

1 

152056620 

152061541 

- 

NM_001008536 

TCHHLl 

-9714 

152071255 

152071455 

2 

105421883 

105467935 

- 

NR_037883 

AC018730.1 

16532 

BBS 

105451119 

105451403 

2 

171034655 

171511675 

+ 

NM_001083615 

MY03B 

11665 

mSSm 

171046320 

171046523 

2 

171034655 

171511675 

+ 

NM_138995 

MY03B 

11665 

171046320 

171046523 

2 

171034655 

171511675 

+ 

NR_045682 

MY03B 

11665 

0.043 

171046320 

171046523 

2 

171034655 

171511675 

+ 

NR_045683 

MY03B 

11665 

0.043 

171046320 

171046523 

2 

171034655 

171511675 

+ 

NR_045684 

MY03B 

11665 

0.043 

171046320 

171046523 

4 

78634541 

78740545 

- 

NM_001286790 

CNOT6L 

68780 

0.192 

78671563 

78671765 

4 

78634541 

78740545 

- 

NM_144571 

CNOT6L 

68780 

0.192 

78671563 

78671765 

5 

89854617 

90460034 

+ 

NM_032119 

GPR98 

513806 

0.033 

90368423 

90368623 

5 

89854617 

90460034 

+ 

NR_003149 

GPR98 

513806 

0.033 

90368423 

90368623 

5 

128796103 

129074377 

+ 

NM_133638 

ADAMTS19 

237590 

0.073 

129033693 

129033894 

6 

57182415 

57513377 

+ 

NM_000947 

PRIM  2 

238579 

0.061 

57420994 

57421194 

7 

100218039 

100239174 

- 

NM_003227 

TFR2 

-4322 

0.000 

100243496 

100243697 

7 

100240726 

100254085 

- 

NM_016188 

ACTL6B 

10388 

1.512 

100243496 

100243697 

8 

42273980 

42358979 

- 

NM_001257180 

SLC20A2 

84550 

0.236 

42274229 

42274429 

8 

42273980 

42396656 

- 

NM_001257181 

SLC20A2 

122227 

0.164 

42274229 

42274429 

8 

42273980 

42397357 

- 

NM_006749 

SLC20A2 

122928 

0.163 

42274229 

42274429 

8 

67782984 

67814015 

+ 

NM_001136161 

MCMDC2 

-520 

0.000 

67782070 

67782464 

8 

67782984 

67834284 

+ 

NM_173518 

MCMDC2 

-520 

0.000 

67782070 

67782464 

8 

67783737 

67817600 

+ 

NM_001136160 

MCMDC2 

-1273 

0.000 

67782070 

67782464 

11 

36505317 

36531864 

- 

NM_004620 

TRAF6 

24303 

0.761 

36507360 

36507561 

11 

36505317 

36531864 

- 

NM_145803 

TRAF6 

24303 

0.761 

36507360 

36507561 

13 

96085853 

96232011 

+ 

NM_001160100 

CLDNIO 

10439 

0.138 

96096292 

96096492 

13 

96085853 

96232011 

+ 

NM_182848 

CLDNIO 

10439 

0.138 

96096292 

96096492 

16 

22357257 

22385939 

- 

NM_001802 

CDR2 

17946 

0.701 

22367793 

22367993 

16 

33961052 

33962504 

- 

NR_038368 

LINC00273 

-1377 

0.000 

33963881 

33964262 

18 

49866542 

51062274 

+ 

NM_005215 

DCC 

135569 

0.017 

50002111 

50002311 

19 

37309224 

37341690 

- 

NM_001242800 

ZNF790 

7747 

0.619 

37333743 

37333943 

19 

37309224 

37341170 

- 

NM_001242801 

ZNF790 

7227 

0.629 

37333743 

37333943 

19 

37309224 

37328930 

- 

NM_001242802 

ZNF790 

-4813 

0.000 

37333743 

37333943 

19 

37309224 

37329287 

- 

NM_206894 

ZNF790 

-4456 

0.000 

37333743 

37333943 

19 

37341260 

37370478 

+ 

NM_003419 

ZNF345 

-7317 

0.000 

37333743 

37333943 

19 

37341778 

37370478 

+ 

NM_001242472 

ZNF345 

-7835 

0.000 

37333743 

37333943 

19 

37341778 

37370478 

+ 

NM_001242474 

ZNF345 

-7835 

0.000 

37333743 

37333943 

19 

37341788 

37384121 

+ 

NR_038362 

ZNF345 

-7845 

0.000 

37333743 

37333943 

19 

37342553 

37370478 

+ 

NM_001242475 

ZNF345 

-8610 

0.000 

37333743 

37333943 

19 

37342553 

37370478 

+ 

NM_001242476 

ZNF345 

-8610 

0.000 

37333743 

37333943 

22 

18560686 

18573798 

+ 

NM_001199319 

PEX26 

8147 

1.533 

18568833 

18569033 

22 

18560686 

18573798 

+ 

NM_017929 

PEX26 

8147 

1.533 

18568833 

18569033 

22 

18560760 

18573798 

+ 

NM  001127649 

PEX26 

8073 

1.542 

18568833 

18569033 

TABLE  10:  GENES  NEAREST  MBDl-SPECIFIC 
PEAKS  IN  SAMPLE  #4  NORMAL  TISSUE 


chromos 

transcript 

transcript 

strand 

Transcript  ID 

Gene 

Distance 

Percent 

Start 

Stop 

location 

start 

stop 

Symbol 

to  TSS 

overlap 
with  gene 

1 

1 

169101768 

169101768 

169337202 

169337202 

- 

NM_013330 
NM  197972 

221772 

221772 

0.0854 

0.0854 

169115230 

169115230 

169115430 

169115430 

1 

169101768 

169337202 

- 

NR_104229 

NME7 

221772 

0.0854 

169115230 

169115430 

2 

140988996 

142889271 

- 

NM_018557 

LRPIB 

220073 

0.0106 

142668997 

142669198 

5 

57878939 

58147407 

+ 

NM_138453 

RAB3C 

221851 

0.1382 

58100790 

58101160 

5 

72848025 

72861512 

- 

NM_023039 

ANKRA2 

0 

0.8749 

72861395 

72861643 

5 

72861566 

72879203 

+ 

NM_001284430 

UTP15 

0 

0.4422 

72861395 

72861643 

5 

72861566 

72879203 

+ 

NM_001284431 

UTP15 

0 

0.4422 

72861395 

72861643 

5 

72861566 

72879203 

+ 

NM_032175 

UTP15 

0 

0.4422 

72861395 

72861643 

6 

121400640 

121655647 

- 

NM_152730 

TBC1D32 

132117 

0.0980 

121523281 

121523530 

6 

121400640 

121655647 

- 

NR_104452 

TBC1D32 

132117 

0.0980 

121523281 

121523530 

10 

131265454 

131565784 

+ 

NM_002412 

MGMT 

152400 

0.0673 

131417854 

131418055 

14 

89290978 

89344336 

+ 

NM_144596 

TTC8 

49741 

0.3767 

89340719 

89340919 

14 

89290978 

89344336 

+ 

NM_198309 

TTC8 

49741 

0.3767 

89340719 

89340919 

14 

89290978 

89344336 

+ 

NM_198310 

TTC8 

49741 

0.3767 

89340719 

89340919 

15 

50150435 

50411420 

- 

NM_024837 

ATP8B4 

190400 

0.0770 

50220820 

50221020 

15 

50150435 

50411420 

- 

NR_073596 

ATP8B4 

190400 

0.0770 

50220820 

50221020 

15 

50150435 

50411420 

- 

NR_073597 

ATP8B4 

190400 

0.0770 

50220820 

50221020 

15 

50150435 

50405514 

- 

NR_073598 

ATP8B4 

184494 

0.0788 

50220820 

50221020 

22 

25423941 

25593416 

+ 

NM_001145206 

KIAA1671 

65827 

0.1227 

25489768 

25489975 

TABLE  11:  GENES  NEAREST  MBDl- 
SPECIFIC  PEAKS  IN  SAMPLE  #6  NORMAL 


chromos 

location 

transcript 

start 

transcript 

stop 

strand 

Transcript  ID 

Gene 

Symbol 

Distance 
to  TSS 

Percent 
overlap 
with  qene 

Start 

Stop 

2 

101623690 

101767847 

- 

NM_001102426 

TBC1D8 

61030 

0.2178 

101706504 

101706817 

2 

232826293 

233208679 

+ 

NM_001257281 

DIS3L2 

192177 

0.0552 

233018470 

233018680 

2 

232826293 

233201909 

+ 

NM_152383 

DIS3L2 

192177 

0.0562 

233018470 

233018680 

2 

232826293 

233201909 

+ 

NR_046476 

DIS3L2 

192177 

0.0562 

233018470 

233018680 

2 

232826293 

233201909 

+ 

NR_046477 

DIS3L2 

192177 

0.0562 

233018470 

233018680 

6 

169857303 

170102160 

- 

NM_001202550 

WDR27 

213321 

0.0829 

169888637 

169888839 

6 

169857303 

170102160 

- 

NM_182552 

WDR27 

213321 

0.0829 

169888637 

169888839 

9 

138898383 

138987132 

- 

NM_144653 

NACC2 

19850 

0.2310 

138967078 

138967282 

15 

25923860 

26108350 

- 

NM_024490 

ATPlOA 

26610 

0.1100 

26081538 

26081740 

16 

9847265 

10276612 

- 

NM_000833 

GRIN2A 

296676 

0.0468 

9979736 

9979936 

16 

9847265 

10276264 

- 

NM_001134407 

GRIN2A 

296328 

0.0469 

9979736 

9979936 

16 

9856532 

10275925 

- 

NM_001134408 

GRIN2A 

295989 

0.0479 

9979736 

9979936 

TABLE  12:  MBDl-SPECIFIC  PEAKS  IN 
SAMPLE  #3  MALIGNANT  TISSUE 


Chromos 

Location 

Start 

Stop 

Interval 

Length 

Maximum 
Extended 
Reads  in 
Window 

Reads 

Per 

Million 

(RPM) 

Mann- 

Whitney 

p-value 

014_: 

total 

reads 

in 

region 

016_Input 
:  total 
reads  in 
region 

p-value 
(Sample  ID 
vs. 

016_Input) 

scaled  fold 
change 
(Sample  ID 
vs. 

016_Input) 

1 

121357889 

121358132 

243 

27 

22.5 

0.0059903 

31 

23 

0.170312 

1.3 

1 

149990803 

149991003 

200 

25 

20.8 

0.0005322 

24 

0 

5.95E-08 

25.0 

2 

41890203 

41890403 

200 

36 

30.0 

3.76E-05 

34 

1 

1.05E-09 

17.5 

2 

119691328 

119691528 

200 

26 

21.7 

l.OOE-06 

25 

0 

2.97E-08 

26.0 

2 

148129583 

148129869 

286 

29 

24.1 

0.0015445 

29 

0 

1.86E-09 

30.0 

3 

38759373 

38759739 

366 

29 

24.1 

1.31E-07 

29 

0 

1.86E-09 

30.0 

3 

137367585 

137367928 

343 

38 

31.6 

5.10E-07 

38 

3 

5.23E-09 

9.8 

4 

167020612 

167020912 

300 

77 

64.1 

6.66E-17 

77 

0 

6.58E-24 

78.0 

4 

174676149 

174676349 

200 

25 

20.8 

l.OOE-06 

25 

0 

2.97E-08 

26.0 

4 

183354639 

183354839 

200 

35 

29.1 

0.0045362 

30 

0 

9.29E-10 

31.0 

5 

11550571 

11550772 

201 

47 

39.1 

0.0172161 

47 

0 

7.08E-15 

48.0 

6 

61914121 

61914321 

200 

25 

20.8 

0.0016718 

30 

24 

0.248142 

1.2 

6 

87549450 

87549843 

393 

28 

23.3 

0.00362 

28 

0 

3.72E-09 

29.0 

7 

28401093 

28401293 

200 

38 

31.6 

4.06E-09 

36 

11 

0.0001727 

3.1 

7 

64868802 

64869021 

219 

31 

25.8 

0.0039565 

31 

1 

7.67E-09 

16.0 

7 

79816175 

79816546 

371 

35 

29.1 

6.63E-09 

35 

0 

2.90E-11 

36.0 

8 

52023643 

52024022 

379 

33 

27.5 

1.81E-08 

33 

0 

1.16E-10 

34.0 

8 

100207668 

100207868 

200 

27 

22.5 

6.07E-07 

27 

0 

7.44E-09 

28.0 

8 

121553804 

121554029 

225 

48 

40.0 

2.49E-08 

48 

0 

3.54E-15 

49.0 

8 

121556941 

121557141 

200 

66 

55.0 

6.61E-09 

67 

0 

6.74E-21 

68.0 

9 

66822750 

66822973 

223 

28 

23.3 

0.0144307 

28 

21 

0.195663 

1.3 

10 

42375378 

42375578 

200 

25 

20.8 

0.001981 

28 

21 

0.195663 

1.3 

11 

48587371 

48587571 

200 

29 

24.1 

9.19E-07 

29 

0 

1.86E-09 

30.0 

11 

90514294 

90514651 

357 

31 

25.8 

0.0143816 

31 

0 

4.65E-10 

32.0 

14 

105854291 

105854491 

200 

31 

25.8 

2.17E-07 

28 

0 

3.72E-09 

29.0 

16 

33955206 

33955451 

245 

34 

28.3 

0.0119406 

40 

13 

0.000134 

2.9 

16 

33957515 

33958134 

619 

49 

40.8 

0.021599 

96 

30 

1.52E-09 

3.1 

16 

46427489 

46427845 

356 

31 

25.8 

0.0148171 

37 

24 

0.0617977 

1.5 

18 

22882568 

22882768 

200 

27 

22.5 

0.0068005 

25 

0 

2.97E-08 

26.0 

18 

26119072 

26119285 

213 

31 

25.8 

8.33E-08 

30 

0 

9.29E-10 

31.0 

21 

10764032 

10764269 

237 

32 

26.6 

0.0120824 

33 

6 

7.14E-06 

4.9 

X 

131584340 

131584546 

206 

30 

25.0 

0.0001078 

30 

0 

9.29E-10 

31.0 

Y 

58884586 

58884874 

288 

28 

23.3 

0.0055728 

28 

25 

0.391723 

1.1 

TABLE  14:  GENES  NEAREST  MBDl- 
SPECIFIC  PEAKS  IN  SAMPLE  #4  MALIGNANT 

TISSUE 


chromos 

location 

transcript 

start 

transcript 

stop 

strand 

Transcript  ID 

Gene 

Symboi 

Distance 
to  TSS 

Percent 

overlap 

with 

qene 

Start 

Stop 

2 

132905164 

133015543 

- 

NR_027020 

ANKRD30BL 

-7193 

0.0000 

133022736 

133022947 

2 

133014539 

133014654 

- 

NR_031608 

MIR663B 

-8082 

0.0000 

133022736 

133022947 

2 

179390717 

179672151 

- 

NM_001256850 

TTN 

32106 

0.1155 

179639721 

179640045 

2 

179390717 

179672151 

- 

NM_001267550 

TTN 

32106 

0.1155 

179639721 

179640045 

2 

179390718 

179672151 

- 

NM_003319 

TTN 

32106 

0.1155 

179639721 

179640045 

2 

179390718 

179672151 

- 

N|V|_133378 

TTN 

32106 

0.1155 

179639721 

179640045 

2 

179390718 

179672151 

- 

NM_133432 

TTN 

32106 

0.1155 

179639721 

179640045 

2 

179390718 

179672151 

- 

NM_133437 

TTN 

32106 

0.1155 

179639721 

179640045 

2 

179609136 

179672151 

- 

Niv|_133379 

TTN 

32106 

0.5157 

179639721 

179640045 

57897237 

b/y /bbbZ 

- 

NM_001553 

IGFBP7 

72412 

0.2547 

57903939 

57904140 

57898404 

57976552 

- 

NM_001253835 

IGFBP7 

72412 

0.2585 

57903939 

57904140 

147628179 

147867035 

_ 

NM  031956 

TTC29 

50711 

0.1461 

147815976 

147816324 

5 

179233388 

179265078 

+ 

NM_001142298 

SQSTMl 

18127 

0.6342 

179251515 

179251715 

5 

179234003 

179265078 

+ 

NM_001142299 

SQSTMl 

17512 

0.6468 

179251515 

179251715 

5 

179247842 

179265078 

+ 

N|V|_003900 

SQSTMl 

3673 

1.1661 

179251515 

179251715 

6 

13363587 

13486416 

- 

NM_001242628 

GFODl 

75357 

0.1726 

13410848 

13411059 

6 

13363587 

13408370 

- 

NM_001242630 

GFODl 

-2478 

0.0000 

13410848 

13411059 

6 

13363587 

13487870 

- 

N|V|_018988 

GFODl 

76811 

0.1706 

13410848 

13411059 

6 

131160488 

131384463 

- 

NM_001135554 

EPB41L2 

34314 

0.1152 

131349892 

131350149 

6 

131160488 

131384463 

- 

NM_001135555 

EPB41L2 

34314 

0.1152 

131349892 

131350149 

6 

131160488 

131384463 

- 

Niv|_001199388 

EPB41L2 

34314 

0.1152 

131349892 

131350149 

6 

131160488 

131384463 

- 

NM_001431 

EPB41L2 

34314 

0.1152 

131349892 

131350149 

7 

29234121 

29553945 

+ 

NM_004067 

CHN2 

206472 

0.0688 

29440593 

29440812 

8 

135812763 

135812851 

- 

NR_029666 

MIR30B 

-7678 

0.0000 

135820529 

135820907 

8 

135817119 

135817189 

- 

NR_029599 

MIR30D 

-3340 

0.0000 

135820529 

135820907 

9 

21277687 

21278563 

- 

NR_036676 

IFNA22P 

-9365 

0.0000 

21287928 

21288322 

9 

119187504 

119449543 

- 

NM_001184734 

ASTN2 

105835 

0.0767 

119343508 

119343708 

9 

119187504 

120177318 

- 

NM_014010 

ASTN2 

833610 

0.0203 

119343508 

119343708 

9 

119187504 

119449495 

- 

NM_198186 

ASTN2 

105787 

0.0767 

119343508 

119343708 

9 

119187504 

119449543 

- 

NM_198187 

ASTN2 

105835 

0.0767 

119343508 

119343708 

9 

119187504 

119449543 

- 

NM_198188 

ASTN2 

105835 

0.0767 

119343508 

119343708 

10 

76586171 

76792381 

+ 

NM_001256468 

KAT6B 

202590 

0.0975 

76788761 

76788961 

10 

76586171 

76792381 

+ 

NM_001256469 

KAT6B 

202590 

0.0975 

76788761 

76788961 

10 

76586171 

76792381 

+ 

NM_012330 

KAT6B 

202590 

0.0975 

76788761 

76788961 

12 

44229664 

44783546 

+ 

NM_001286211 

TMEM117 

321060 

0.0462 

44550724 

44550979 

12 

44229664 

44783546 

+ 

NM_001286212 

TMEM117 

321060 

0.0462 

44550724 

44550979 

12 

44229664 

44783546 

+ 

NM_001286213 

TMEM117 

321060 

0.0462 

44550724 

44550979 

12 

44229664 

44783546 

+ 

NM_032256 

TMEM117 

321060 

0.0462 

44550724 

44550979 

12 

120941082 

120966965 

- 

N|V|_032314 

COQ5 

19128 

0.7765 

120947637 

120947837 

14 

55034330 

55260034 

+ 

NM_001161576 

SAMD4A 

166374 

0.1015 

55200704 

55200932 

14 

55034330 

55260034 

+ 

NM_015589 

SAMD4A 

166374 

0.1015 

55200704 

55200932 

15 

52599480 

52821248 

- 

N|V|_000259 

MY05A 

4142 

0.1429 

52816790 

52817106 

15 

52599480 

52821248 

- 

NM  001142495 

MY05A 

4142 

0.1429 

52816790 

52817106 

15 

53805938 

54051860 

- 

NM_182758 

WDR72 

110839 

0.0817 

53940821 

53941021 

15 

53805938 

54055076 

- 

NR_102334 

WDR72 

114055 

0.0807 

53940821 

53941021 

16 

33961052 

33962504 

- 

NR_038368 

LINC00273 

1541 

0.0000 

33960621 

33960963 

17 

20483037 

20484225 

+ 

NM_001190790 

CDRT15L2 

2679 

0.0000 

20485716 

20485916 

17 

30819628 

31203903 

- 

N|V|_015194 

MYOID 

362590 

0.0523 

30841113 

30841313 

17 

79762010 

79771890 

+ 

NM_000160 

GCGR 

5058 

2.1354 

79767068 

79767278 

20 

2082528 

2129199 

+ 

NM_080836 

STK35 

18104 

0.5806 

2100632 

2100902 

X 

85403455 

86087606 

+ 

NM_001139514 

DACH2 

288284 

0.0294 

85691739 

85691939 

X 

85403455 

86087606 

+ 

NM_053281 

DACH2 

288284 

0.0294 

85691739 

85691939 

X 

85517971 

86087606 

+ 

NM_001139515 

DACH2 

173768 

0.0353 

85691739 

85691939 

TABLE  15;  GENES  NEAREST  MBDl-SPECIFIC 
PEAKS  IN  SAMPLE  #6  MALIGNANT  TISSUE 


chromos 

location 

transcript 

start 

transcript 

stop 

strand 

Transcript  ID 

Gene  Symbol 

Distance  to 

TSS 

Percent 

overlap 

with 

gene 

Start 

Stop 

2 

132905164 

133015543 

- 

NR_027020 

ANKRD30BL 

-8377 

0.0000 

133023920 

133024279 

2 

133014539 

133014654 

- 

NR_031608 

MIR663B 

-9266 

0.0000 

133023920 

133024279 

7 

156230483 

156238283 

- 

NR_038232 

LOC285889 

-4792 

0.0000 

156243075 

156243275 

13 

96743093 

97491817 

+ 

NM_153456 

HS6ST3 

674969 

0.0279 

97418062 

97418270 

14 

93799565 

94173690 

+ 

NM_020818 

UNC79 

75169 

0.0540 

93874734 

93874935 

17 

9153788 

9479276 

- 

NM_004853 

STX8 

27909 

0.0621 

9451166 

9451367 

17 

9153788 

9479276 

- 

NR_033656 

STX8 

27909 

0.0621 

9451166 

9451367 

18 

109065 

122223 

+ 

NR_033770 

ROCKIPI 

1624 

3.7237 

110689 

111178 

18 

112256 

112340 

- 

NR_107045 

MIR8078 

1162 

0.0000 

110689 

111178 

18 

29843484 

30050448 

- 

NM_001242409 

GAREM 

51507 

0.1227 

29998688 

29998941 

18 

29843484 

30050448 

- 

NM_022751 

GAREM 

51507 

0.1227 

29998688 

29998941 

21 

14410487 

14490572 

+ 

NR_026916 

ANKRD30BP2 

-2013 

0.0000 

14408274 

14408474 

